Trustworthy Reasoning: Evaluating and Enhancing Factual Accuracy in LLM Intermediate Thought Processes
arxiv.org·2d
The Pragmatic Mind of Machines: Tracing the Emergence of Pragmatic Competence in Large Language Models
arxiv.org·2d
SketchMind: A Multi-Agent Cognitive Framework for Assessing Student-Drawn Scientific Sketches
arxiv.org·2d
CoGrader: Transforming Instructors' Assessment of Project Reports through Collaborative LLM Integration
arxiv.org·5d
Loading...Loading more...